Practical Evaluation of IR within Automated Classi cation

نویسندگان

  • R Dolin
  • J Pierre
  • M Butler
  • R Avedon
چکیده

This paper describes some of the work we have done to evaluate and compare the use of three IR systems (Verity, LSI, and SMART) as black boxes within an automated classiication environment. We use automated classiication to make a quantitative comparison of the eeectiveness of the systems within this context. In so doing, we also develop criteria for the construction of a useful training set. These results lead to metrics useful in the integration of IR systems into larger applications. We conclude with an initial API for an IR component within an automated classiication architecture.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Feature Subset Selection Using a Genetic Algorithm

Practical pattern classi cation and knowledge discovery problems require selection of a subset of attributes or features (from a much larger set) to represent the patterns to be classi ed. This paper presents an approach to the multi-criteria optimization problem of feature subset selection using a genetic algorithm. Our experiments demonstrate the feasibility of this approach for feature subse...

متن کامل

A Scheme for Automated Classi cation of Images

The explosive increases in the amounts of image (and multimedia) data being generated, processed, and used in several computer applications have necessitated the development of image (and multimedia) database systems. Example applications for image databases include digital libraries, radiological image archives, satellite imagery for earth resources, law enforcement, etc. The classi cation of ...

متن کامل

CLEF-IP 2010: Retrieval Experiments in the Intellectual Property Domain

In the recent decade that research in IR methods for Intellectual Property domain has increased. The rst e orts in observing how information retrieval is done in patent domain were done with the series of Nist workshops (see for example [2]). Lately, more workshops and conferences are dedicated to bringing together IR and IP specialists [3,8]. In 2008, the Irf obtained the agreement to coordina...

متن کامل

First order Gaussian graphs for e#cient structure classi$cation

First order random graphs as introduced by Wong are a promising tool for structure-based classi$cation. Their complexity, however, hampers their practical application. We describe an extension to $rst order random graphs which uses continuous Gaussian distributions to model the densities of all random elements in a random graph. These First Order Gaussian Graphs (FOGGs) are shown to have severa...

متن کامل

Classi ® cation by progressive generalization : a new automated methodology for remote sensing multichannel data

A new procedure for digital image classi® cation is described. The procedure, labelled Classi ® cation by Progressive Generalization (CPG), was developed to avoid drawbacks associated with most supervised and unsupervised classi® cations. Using lessons from visual image interpretation and map making, non-recursive CPG aims to identify all signi® cant spectral clusters within the scene to be cla...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999